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The cross section for production of Z bosons with two associated b-jets, and Z decaying to leptons, 
can be measured at the LHC with about 100 pb~ 1 of data. We use simulated data to study possible 
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the total backgrounds to several of the Higgs discovery channels at the LHC. 
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1. Introduction 

At the LHC, Z/y* can be produced in association with two b-quarks, gg/qq — > bbZ, with a 
significant cross section [jl|], [Qj, [^]. The measurement of this process is an important test of QCD 
calculations and also can help to reduce the uncertainty of the Super Symmetry bbH production 
cross section calculation [|J], since the two processes are produced via similar partonic processes at 
the initial state. This process is also a background to Higgs boson discovery channels at the LHC, 
like Standard Model H — ► ZZ — ► 4£ and associated production of SUSY Higgs boson bb(p, cp — > 
TT (n/J.) (<p = h, H, A). In the following, the possibility of observing and measuring the cross 
section of the bbZ production at the LHC, using 100 pb -1 of early data at CMS experiment, has 
been described which was studied in [Q]. A related, but different cross section for pp — > Z + &-jet 
has previously been measured at the Tevatron, both by DO and CDF [§], [f7p. 



2. Monte Carlo Signal and Background samples 

The signal events tlbb (Zbb ) were generated at partem level using the leading order (LO) 
CompHEP generator ^ and hadronized with PYTHIA ^ with the following generator-level cuts 
: Pf >10 GeV/c, \r) b \ <10, mu > 40 GeV/c 2 and \r) e \ <2.5. The next-to-leading order cross- 
section o(££bb) = 45.9 pb (£=e, jJ,, t) was calculated with the MCFM program [jllj applying 
the same generator-level cuts. The CTEQ6M partem density functions (pdf) and scale [Ir = Hf = 
Mz were used. The backgrounds considered for this process were Drell-Yan Z/y* — > ££ {£ = 
e,fl,z) production in association with two or more light-quark and gluon jets (Z+jets), £fcc+jets 
(Zcc+jets) and ??+jets. Background samples were generated at the leading order (LO) with the 



ALPGEN generator [11]. The ??+jets ALPGEN events were normalized to the NLO inclusive tt 
cross section 840 pb [12]. The Zcc+jets events were normalized to the NLO MCFM cross section 
of ££cc, 13.29 pb, applying the ALPGEN production cuts: p c T >20 GeV/c, |t] c | <5, m u > 40 
GeV/c 2 . The same pdf and scale settings as for ££bb process were used. The Z+jets events were 
normalized to NLO MCFM cross section of ££ + 2 jets, 714 pb, applying the ALPGEN production 
cuts: pj >20 GeV/c, \r\i\ <5, mu > 40 GeV/c 2 . Signal and background samples were passed 
through the full simulation and reconstruction chain of CMS, under an imperfect calibration and 
alignment configuration assumed to be typical of the first 100 pb -1 of data at CMS. 



3. Event Selection 



The events were selected by the CMS Level- 1 (LI) and High-Level (HLT) single isolated 
electron and muon triggers designed for the low luminosity period (L=10 32 cm _2 s _1 ). In offline, 
Events with at least two isolated and oppositely charged electrons or muons, with p^ greater than 
20 GeV, |T7 e | < 2.5 and |t]^| < 2.0, were selected to form Z boson candidate. Events are selected 
with two or more jets, reconstructed using iterative cone algorithm of cone size 0.5, with corrected 
jet Ej greater than 30 GeV and jet |tj| < 2.4. The v\ cut ensured good quality b-tagging. The 
Figure [I] (Left) shows the distribution of di-lepton invariant mass for the electron final state events 
with at least two jets passing lepton and jet selections. The b-tagging is an important and effective 
tool to ensure the purity of Zbb events and reduce the Z + jets and Zcc+jets backgrounds since 
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Figure 1: Left : Di-lepton invariant mass distribution for events passing lepton and jet selections as described 
in text. Right : Di-lepton invariant mass for events after all selections except b-tagging. 



the b-tag discriminator values for light quarks, gluon and c-quark jets tends to be lower compared 
to that of the b-quark jets. The events were double b-tagged using 'track counting' b-tagging, 
which uses the 3D track impact parameter significance of the 3rd highest significance track in 
the jet as the b-tagging discriminator. A b-tag discriminator value of 2.5 has been used in the 
analysis which ensured high purity of b-tagged jets. The possibility of applying selections on the 
amount of missing transverse energy in the event, E" lss , is exploited to suppress ??+jets background 
events since ??+jets events contain neutrinos from W — ► iv decays and has higher Ej' ss compared 
to Zbb events. A selection cut on Ef' ss less than 50 GeV is applied on Ej lss , where E™ m is 
reconstructed using calorimeter tower information and calibrated using jet energy corrections and 
muon corrections. The dilepton invariant mass distributions of the signal and various background 
events passing all selection criteria are shown in Figure |2| The events are scaled to 100 pb -1 of 
data. 



O 
> 


30 


CD 




o 


25 


o 






</> 


20 






CD 




> 
CD 


15 






15 




i— 
CD 


10 


_Q 




E 






5 












L= 100 pb 



□ zbb 

Z+jeis 

Zcc-jets 
I | rt+jets 
-•- 100 pb" 1 data 

CMS Preliminary - 




20 40 60 80 100120140160180200 
M„ (GeV/c 2 ) 



Figure 2: Di-lepton invariant mass for events after all selections. The events are normalized to 100 pb 
of data. The dots are result of one random MC experiment with 100 pb -1 of data. Nearly 46 events are 
expected in the mass range 75-105 GeV with S/B ~ 3.6 
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4. Background Estimation and Systematic Uncertainties 

The contribution of the «+jets background events in the signal region of the di-lepton invari- 
ant mass distribution can be estimated from the data, by extrapolating sidebands in the di-lepton 
invariant mass spectrum. The following method is used to estimate the £?+jets background us- 
ing the relation Nz(tt) = (£z(tt) / £ no z(tt)) x N no z(tt), where Nz(tt)=8 is the expected number of 
??+jets events in the signal region within the ma mass window of 75-105 GeV/c 2 , N no z{tt)=21 is 
the measured number of tt events outside the signal region, £z(tt) is the selection efficiency for 
??+jets events in signal region, £ no z{tt)) is the selection efficiency for ??+jets events outside the sig- 
nal region. ANz(tt) is the uncertainty of the expected number of ??+jets events in the signal region 
given by ANz(tt)/Nz(tt) = 1/ 'y/N no z{tt). The uncertainty on the ratio £z(tf) / e no z{tt) is negligible 
in comparison to the statistical uncertainty of the number of events outside the signal region. 

The following equations are used to determine the number of Zbb events (Nzbb) and it's un- 
certainty ANzbb for 100 pb -1 , after all selections except double b-tagging. 

N before b-ta S = ^ . . + ^ + ^ ^ 
N after b-tag = £f x ^ . . + ^ x ^ + ^ x ^ (4 2) 

where, N z e ^" re h tag = 4644 is the measured number of Z/y* — > tt events in the di-lepton mass 
window between 75 and 105 GeV/c 2 after all selections have been applied except double b-tagging. 
It receives negligible contribution (less than 1%) from the tt background, as one can see from 
Figure [j] (Right). N^'" b tas = 38 is the measured number of Z/y* — » tt events after all selections 
including double b-tagging. It is defined after subtraction of the tt background described above. 
Nzjj is the unknown number of tt+jets (jet=u,d,s,g) events before double b-tagging. Nzcc is the 
unknown number of ttcc events before double b-tagging. Nzbb is the unknown number of ttbb 
events before double b-tagging. £^ is the efficiency of double b-tagging for Zbb signal events 
which is determined from the Monte-Carlo simulation tuned to reproduce the b-tagging efficiency 
measured from the data, e c is the efficiency of double b-tagging for Zcc+jets background events 
which is determined from the Monte Carlo, £( is the efficiency of double b-tagging for Z+jets 
(jet=u,d,s,g) background events which is determined from the Monte-Carlo simulation tuned to 
reproduce the mistagging efficiency measured from the data. 

The number of unknown parameters are reduced to two, following the DO analysis approach 
[fj], by using the theoretical ratio of cross sections and ratio of selection efficiencies 

R = a j ZcC j x = 0.046 ± 0.002 (4.3) 
o(Zjj) e%jj 

and by replacing Nz cc by Rx Nzjj- The uncertainty of the ^fjjy ratio due to }Xr, }Xp scale variation 
and JES and MET scale are considered. The equations are solved to evaluate the value of Nzbb- 
The uncertainty on Nzbb, &Nzbb, is than calculated due to the uncertainties of N^ te ' b ~ tag from 
tt background subtraction (SN^ ter ' ag =4.0%), the uncertainty of R and the uncertainties of £b, 
£ c and ££. From the estimated number of Nzbb events after all selections except b-tagging, the 
number of Zbb events before the lepton and jet selections and cut on Ef lss can be evaluated. The 
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systematic uncertainties due to Jet Energy Scale, Ef' ss scale, lepton selections, b-tagging, Monte 
Carlo dependence on pj and 77 of jets and luminosity measurement have been taken into account 
according to the values foreseen to be achieved in CMS with data corresponding to 100 pb -1 of 
integrated luminosity. 

The statistical uncertainty is defined as AN se i/N se i=l / y/N se i, where N se i is the measured num- 
ber of events after all selections (46), thus it is 8N se i=\A.l%. 

Finally, the ttbb (£=e, jl) cross-section is expected to be measured with an accuracy of 

80 = ± HI {syst) ± 15% {stat.) (4.4) 

5. Conclusion 

The process pp — > bbZ at LHC needs to be well understood because of its importance as a 
background to SM and SUSY Higgs boson discovery channels. The possibility of measuring the 
bbZ,Z — > It process at CMS with the first 100 pb -1 of data has been studied with a robust selection 
of leptons and b-jets. Possible methods of measuring backgrounds from data have been discussed. 
The statistical and systematic uncertainties have been discussed. 
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